Briefly Noted
نویسنده
چکیده
Corpora are frequently used to investigate authentic language use and a variety of largesample statistical procedures are currently employed and developed for this purpose. This book provides a largely comprehensive and succinct overview of the most commonly employed statistical procedures with their mathematical foundations and is thus useful for people interested in gaining a deeper understanding of statistical theory and practice in corpus linguistics. In a clear and precise manner, the author outlines the most common univariate and multivariate statistical procedures and discusses sample case studies to show how these procedures can be useful for corpus linguistics. The book includes chapters on statistical foundations, factor analysis, clustering techniques, and concordancing, as well as two chapters on information theory and literary detective work. The author covers a broad range of statistical techniques while outlining the major research questions and methodologies in corpus linguistics. The formatting of the book facilitates an understanding of the rather complex material, because key terms are bolded, tables are clearly and consistently laid out, and formulas are set apart from the main text using sufficient space. The key terms are further collected in a glossary that contains concise definitions and is useful as a quick reference guide and refresher. The book also includes tables of important distributions and helpful references to additional readings at the end of each chapter. Moreover, each chapter concludes with a number of exercises and their solutions are listed briefly on one page in the appendix. There are only a few aspects of the book that could be improved in future editions. First, although the author provides brief syntheses of relevant corpus studies, he often does not provide directions for the application of statistical procedures beyond the context of those studies. Applications of statistical procedures are frequently misunderstood in scholarly research because the researchers are not sufficiently informed about the underlying assumptions of the procedures and the consequences of violating them. It would thus be helpful to readers who are not familiar with a statistical procedure to include tables that briefly summarize its assumptions or lists that summarize its main goals. Second, it might be helpful to reduce some of the elaborate and detailed descriptions of computational algorithms, because nowadays statistical work typically relies on computers and statistical software that automatically perform the salient computations. It would be beneficial for the novice statistical researcher to be provided with more information about the interpretations of model parameters and the main goals of each procedure and be warned about common misconceptions about them. Finally, given the relative complexity of the concepts in the book, it would be helpful to include some of the computational steps for the exercises in the appendix along with the solutions. In summary, Oakes provides a comprehensive, principled, and mostly succinct introduction to the use of statistical procedures and their applications to investigations of large language databanks. The book is thus a useful reference guide for both corpus linguists and those who are interested in becoming one, but the reader of this book should be prepared to deal with a large array of dense material that includes numerous tables, formulas, and mathematical symbols.--Andr~ Rupp, Northern Arizona University
منابع مشابه
Simulating Societies using Distributed AI
This paper discusses the prospects for using Distributed AI techniques to support the computer simulation of societies. Newly developed ideas and techniques are reviewed, some relevant projects are briefly described, and some potential pitfalls are noted.
متن کاملThe Elastic-plastic Mechanics of Crack Extension
This paper briefly reviews progres~ in the elastic plastic analysis of crack extension. Analytical results for plane strain and plane stress deformation fields are noted, and elastic-plastic fracture instability as well as transitional behavior and combined rate and thermal effects are discussed.
متن کاملMallory's ('alcoholic') hyaline in primary biliary cirrhosis.
Mallory's (;alcoholic') hyaline has been found in hepatocytes in 18 of 70 patients with primary biliary cirrhosis. These inclusions have previously been noted in only three cases of primary biliary cirrhosis. Current views on the nature of Mallory's hyaline are briefly discussed.
متن کاملHow Far Can You Trust A Computer?1
The history of attempts to secure computer systems against threats to confidentiality, integrity, and availability of data is briefly surveyed, and the danger of repeating a portion of that history is noted. Areas needing research attention are highlighted, and a new approach to developing certified systems is described.
متن کاملEquivalence relations and behavior: an introductory tutorial.
With an emphasis on procedural fundamentals, the original behavior-analytic equivalence experiments and the equivalence paradigm are described briefly. A few of the subsequent developments and implications are noted, with special reference to the possible significance of the findings with respect to language and cognition.
متن کاملDevelopment of compound semiconductor detectors at ESA
Some examples of space-borne applications that require improvements in detector technology compared with conventional Si and Ge designs are described. Properties of compound semiconductors are noted, and a range of different detector developments are briefly reviewed. Material fabrication improvements for several compound semiconductors have resulted in near Fano-limited performance.
متن کامل